AITopics | good initialization

Collaborating Authors

good initialization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Density Estimation via Discrepancy Based Adaptive Sequential Partition

Dangna Li, Kun Yang, Wing Hung Wong

Neural Information Processing SystemsMar-23-2026, 03:02:34 GMT

Given iidobservations from an unknown absolute continuous distribution defined on some domain Ω, we propose a nonparametric method to learn a piecewise constant function to approximate the underlying probability density function. Our density estimate is a piecewise constant function defined on a binary partition of Ω. The key ingredient of the algorithm is to use discrepancy, a concept originates from Quasi Monte Carlo analysis, to control the partition process. The resulting algorithm is simple, efficient, and has a provable convergence rate. We empirically demonstrate its efficiency as a density estimation method. We also show how it can be utilized to find good initializations for k-means.

artificial intelligence, machine learning, partition, (13 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County (0.28)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)

Add feedback

An Improved Analysis of Alternating Minimization for Structured Multi-Response Regression

Sheng Chen, Arindam Banerjee

Neural Information Processing SystemsFeb-12-2026, 21:35:30 GMT

Due to the non-convex nature of such joint estimators, the theoretical justification of their efficiency is typically challenging.

artificial intelligence, initialization, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Israel (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

LowerBound

Neural Information Processing SystemsFeb-11-2026, 05:41:36 GMT

Then, we consider sufficient assumptions under which learning good policies requires polynomial number of episodes.

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

f40ee694989b3e2161be989e7b9907fc-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 02:46:15 GMT

Clustering is a fundamental problem in data science, especially in the early stages of knowledge discovery.

artificial intelligence, arxivpreprintarxiv, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Data Science (0.86)

Add feedback

Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable Networks

Neural Information Processing SystemsDec-24-2025, 12:26:53 GMT

A striking observation about iterative magnitude pruning (IMP; Frankle et al. 2020) is that--after just a few hundred steps of dense training--the method can find a sparse sub-network that can be trained to the same accuracy as the dense network. However, the same does not hold at step 0, i.e. random initialization. In this work, we seek to understand how this early phase of pre-training leads to a good initialization for IMP both through the lens of the data distribution and the loss landscape geometry. Empirically we observe that, holding the number of pre-training iterations constant, training on a small fraction of (randomly chosen) data suffices to obtain an equally good initialization for IMP. We additionally observe that by pre-training only on easy training data, we can decrease the number of steps necessary to find a good initialization for IMP compared to training on the full dataset or a randomly chosen subset. Finally, we identify novel properties of the loss landscape of dense networks that are predictive of IMP performance, showing in particular that more examples being linearly mode connected in the dense network correlates well with good initializations for IMP. Combined, these results provide new insight into the role played by the early phase training in IMP.

good initialization, lottery ticket, name change, (6 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Gambling (0.43)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

An Improved Analysis of Alternating Minimization for Structured Multi-Response Regression

Sheng Chen, Arindam Banerjee

Neural Information Processing SystemsNov-20-2025, 23:28:22 GMT

Due to the non-convex nature of such joint estimators, the theoretical justification of their efficiency is typically challenging.

altmin, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Israel (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Add feedback

Efficient Clustering for Stretched Mixtures: Landscape and Optimality

Neural Information Processing SystemsAug-22-2025, 01:08:05 GMT

Suppose that we observe i.i.d.

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.04)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

e32cc80bf07915058ce90722ee17bb71-AuthorFeedback.pdf

Neural Information Processing SystemsAug-17-2025, 00:01:35 GMT

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.39)

Add feedback

Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable Networks

Neural Information Processing SystemsJan-13-2025, 20:16:37 GMT

A striking observation about iterative magnitude pruning (IMP; Frankle et al. 2020) is that--after just a few hundred steps of dense training--the method can find a sparse sub-network that can be trained to the same accuracy as the dense network. However, the same does not hold at step 0, i.e. random initialization. In this work, we seek to understand how this early phase of pre-training leads to a good initialization for IMP both through the lens of the data distribution and the loss landscape geometry. Empirically we observe that, holding the number of pre-training iterations constant, training on a small fraction of (randomly chosen) data suffices to obtain an equally good initialization for IMP. We additionally observe that by pre-training only on "easy" training data, we can decrease the number of steps necessary to find a good initialization for IMP compared to training on the full dataset or a randomly chosen subset.

good initialization, lottery ticket, sparse trainable network, (3 more...)

Neural Information Processing Systems

Genre: Contests & Prizes (0.40)

Industry: Leisure & Entertainment > Gambling (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Density Estimation via Discrepancy Based Adaptive Sequential Partition

Neural Information Processing SystemsMar-12-2024, 08:16:27 GMT

Given iid observations from an unknown absolute continuous distribution defined on some domain Ω, we propose a nonparametric method to learn a piecewise constant function to approximate the underlying probability density function. Our density estimate is a piecewise constant function defined on a binary partition of Ω. The key ingredient of the algorithm is to use discrepancy, a concept originates from Quasi Monte Carlo analysis, to control the partition process. The resulting algorithm is simple, efficient, and has a provable convergence rate. We empirically demonstrate its efficiency as a density estimation method. We also show how it can be utilized to find good initializations for k-means.

algorithm, initialization, partition, (11 more...)

Neural Information Processing Systems

Country: